THE DISTRIBUTION OF THE VARIANCE OF PRIMES IN 
ARITHMETIC PROGRESSIONS 



DANIEL FIORILLI 



Abstract. Hooley conjectured that the variance V{x; q) of the distribution of primes up 
to X in the arithmetic progressions modulo q is asymptoticaUy xlogq, in some unspecified 
range of q < x. On average over 1 < q < Q, this conjecture is known unconditionally in 
the range a;/(loga;)"^ < Q < x; this last range can be improved to a; 2+'^ < Q < x under the 
Generalized Riemann Hypothesis (GRH). We argue that Hooley 's conjecture should hold 
down to (log log a;)"'^^'''^-' < q < x for all values of q, and that this range is best possible. 
We show under GRH and a linear independence hypothesis on the zeros of Dirichlet L- 
functions that for moderate values of q, 0(q)e~^y(e^; q) has the same distribution as that of 
a certain random variable of mean asymptotically (f>{q) log q and of variance asymptotically 
20(q)(log q)^. Our estimates on the large deviations of this random variable allow us to 
predict the range of validity of Hooley's Conjecture. 



1. Introduction 

Define the variance of tlie distribution of primes in arithmetic progressions a£| 

^(a;,Xo) ^ 



V{x;q):= 

a mod q 
{a,q)=l 



ilj{x;q,a) 



The study of this important quantity has a long history. One of the major apphcations of 
the large sieve is the Barban-Davenport-Halberstam Theorem |Ba[ \DY{\ [G] , which asserts 
that the average of V{x] q) over 1 < g < Q is 0(xlog(5), in the range a;/(loga;)'^ <Q<x. 
An asymptotic result in this range was first obtained by Montgomery |Mol] . who showed 
that for any fixed A, we have ioi Q < x that 

— y_^V{x]q)=x\ogx + OA(x \og{2x IQ) + — 

This estimate was refined by Hooley |H2] . who showed that in the same range, 



Q 



„2 



- > V{x; q) =x\ogQ-cx + OA[ Q'^x'^ + — , (2) 



where c = 7 + log(27r) + 1 + Ep ^J^- Hooley [H2l |H3] also showed that the error term in 
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(|2]) could be replaced by O^iQ^x* + x^^^/Q) under the Generalized Riemann Hypothesis 
(GRH), extending the range of validity of the asymptotic ^ X]g<Q ^(^i •?) ~ xlogQ to 



^This variance is sometimes defined with il]{x] xo) replaced by either x or ^'(2^)1 however these definitions 
are all equivalent for our purpose, which is to study the validity of the asymptotic V{x; q) ^ xlogq. 
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X2+'^ < Q < X. The most precise result so far under GRH is that of Goldston and Vaughan 
|GV) . which states that for Q < x, 

= a;logQ-ca; + Oe ( Q(a;/Q) + ^ (log 2x)t (log log 3a;)M . (3) 
^ q<Q y W J 

As for individual values of q, Hooley |H7i IHl] conjectured that in some range of g < x we 
have 

V{x; q) ~ xlog q. (4) 
It is still an open problem to determine whether (jlj) holds in any range of q. A lower bound 
of the conjectured order of magnitude was obtained by Friedlander and Goldston [FGlJ . who 
showed that in the range a;/(logx)'^ < < a;, 

V{x]q) ^ Q - ^(1)) xlogg. 

Hooley |H6| improved the range of validity of this bound to exp(— C A/log x) < q < x. 

Similar bounds are known in a wider range of q under GRH |FGlt IFG2t IH6] : the most 
precise lower bound appearing in the literature is 

V{x]q)>(^-^-o{l)^x\ogq (5) 

in the range xi^^ ^ q ^ x, where a = logg/logx. As for upper bounds, Turan [Tu] has 
shown under GRH that V{x;q) <^ x(loga;)^. Friedlander and Goldston [FGlJ have shown 
that if in addition to GRH one assumes a strong version of the Hardy-Littlewood Conjecture 
on prime pairs, then holds in the range X2+^ < 9 < a;. More precisely, they show that 
if we assume the (ordinary) Riemann Hypothesis and we assume for fixed e > and in the 
range < \k\ < x that 

A{n)A{n + k) = e{k){x - \k\) + 0{x^+'), 

max(0,— fc)<n<min(x,x— fc) 

where the singular series for prime pairs is defined by 

e(k):^l'^'*'('-^)H^^^ if MO is even, 
I if /c is odd, 

then we have for x^^*^ < g < the estimate 

= log, - (j + log(2.) + E ^) + (7) 

In the range x^~^ < g < x, their estimate takes the form V{x] q)/x = logg + 0((loglogg)^). 
Finally, Vaughan established upper bounds on the general k-th moment of the error term 
in ([7j) for Q/2 < q < Q in the range x(logx)~'^ < <5 < (unconditionally) and X4+'^ < Q < x 
(under GRH). 

We wish to emphasize that ([3]) gives an asymptotic result for Q in the range X2+^ < Q < x, 

and that no asymptotic results are known in the range Q < x^ , even conditionally. Moreover, 

([5]) gives the correct order of magnitude of V{x] q), provided X3+'^ < g < a;. Even under a 
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strong version of the Hardy-Littlewood Conjecture, the best known range of vahdity of (jl]) 
is a;2+^ < ? < X. Therefore, the behaviour of V{x; q) for g < a;2 is a mystery and it is not 
clear whether an asymptotic formula such as (jlj) should hold in this range. As Friedlander 
and Goldston [FGlJ put it. 

It may well be that these also hold for smaller q, but below q = we are 
somewhat skeptical. 

In a recent paper, Keating and Rudnick |KR| (see also |K1[ IK2| ) established a function 
field analogue of Hooley's conjecture which suggests that (jl]) might hold in the extended 
range < g < x, for any fixed e > 0. 

In the current paper we establish a probabilistic result which suggests that (jl]) should hold 
all the way down to g = (log log x)^^'^, and should not hold in the range q < (log log x)^""^. 

Conjecture 1.1. Fix 6 > 0. In the range (log log x)^^^ < q < x we have 

V{x] g) ~ X log g. 

To justify this conjecture, we will show in Proposition 11.71 that under GRH and a Linear 
Independence Hypothesis, the limiting logarithmic distribution of (f){q)V{x]q)/x coincides 
with the distribution of an explicit random variable Hg (see Definition 12. ip . 

1.1. Analysis of the random variable Hg under GRH. We analyze the random variable 
Hg defined in (jH]), by first computing its mean and variance. 

Theorem 1.2. Assume GRH. Then the random variable Hg defined in (jH]) satisfies 

E|ff,l = ^fe)log,(l + O (!^) ). VarlH,] = 2^fe)(log,)^ (l + O {'^)) ■ 

If we moreover assume that L(|,x) 7^ (this is Chowla's Conjecture), then we have a more 
precise estimate for the mean: 




Hence, Hg is a random variable which is concentrated about its mean E[ifg] ~ 0(g) logg. 
In light of Proposition 11.7] this gives an intuitive reason why (jlj) should hold. Our main 
result is an estimate on the large deviations of iJg, which gives information on the range of 
vahdity of (jlj). 

Theorem 1.3. Assume GRH, and let Hg he the random variable defined m (j9]). If q is 

large enough and (log log g)^/ logg < e < 69 (e can depend on q), then we have the following 
hounds on the large deviations of Hg: 

^exp(-cieV(g)) < Proh[\Hq -0(g) log g| > e0(g)logg] < 2 exp(-C2e^0(g)), (8) 
where e^, Ci and C2 are absolute constants. 
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Remark 1.4. If in addition to GRH we assume that 7^ and we replace Hq — 

(f){q) \ogq with Hq — 0(g) ( log g — 7 — log(27r) — X]p|q O, then we can extend the 

range of e to \I^(g) log q/q < e < Cq, where \l/(g) is any function tending to infinity with q and 
eo is an absolute constant. 

Remark 1.5. What Theorem 11.31 roughly says is that the large deviations of Hq are those of 
a normal distribution of mean E[Hq\ fa 0(g) log g and variance Var[ifg] f» 20(g) (log g)^ (see 
Theorem ll.2p . Indeed, such a distribution Zq ~ A^(0(g) log g, 20(g) (log g)^) satisfies 



Prob[|Zq — 0(g) logg| > e0(g) logg] = Prob 



-'"'^dt exp(-6^0(g)/4). 



yf^Ti Je^f^(q)/2 CV 71"0(g) 

1.2. Relation between V{x]q) and Hq. Assuming the following linear independence hy- 
pothesis and GRH, we will show that the distribution of Hq coincides with the limiting 
logarithmic distribution of 0(g) l^(x; q)/x. 

Hypothesis LI (Linear Independence). For any g > 1, the multiset Z{q) := {'^{p) > : 
L(p, x) = 0, X mod g, 3?(p) > |}, that is the set of all non-negative imaginary parts of zeros 
of L{s, x) with X mod g and 3?(s) > |, is linearly independent over Q. 

Hypothesis LI first appeared for ({s) in the work of Wintner |W) on the distribution 
of x~2[iIj[x) — x), and has subsequently been used by many authors for similar purposes 
[H5[ IMo2t IMn] . It is now a standard hypothesis in the study of prime number races |RSt 
IFeMal [Mai [NTl iFlMal [LaTl [La2l lLa3l IFITI lFl2] . Partial results towards LI include the work 
of Martin and Ng in the case of Dirichlet L-functions |MaN] and the work of Kowalski in 
the context of hyperelliptic curves over finite fields |Ko| . 

Remark 1.6. Hypothesis LI implies that all nontrivial zeros of L{s,x) are simple and do 
not lie on the real line. 

Proposition 1.7. Assume GRH and LI. Then as x ^ 00, the limiting logarithmic distribu- 
tion of (j){q)V{x; q)/x coincides with that of the random variable Hq. 



Going back to Theorem II. 31 we see that under GRH and LI, the probability that V{e'^; q) 7^ 
e^logg, that is the probability that for a fixed < e < eo we have \Hq — 0(g) log g| > 
e0(g) logg, is at most 2 exp(— C2e^0(g)). Hence we expect that this event does not happen at 
all in the range y = o(exp(c2e^0(g))); this translates to the statement that V{x] g) ~ x log g in 
the range (loglogx)^^^ < g < x°^^\ justifying Conjecture ll.il We will expand this argument 
in the concluding remarks. 

Remark 1.8. It is interesting to note that the secondary term —7 — log(27r) — X^p|^ 
appearing in Theorem 11.21 is identical to the secondary term appearing in ([7]) . Indeed, these 
terms come from quite different sources. The one appearing in the current paper comes 
from the following GRH estimate, with the running through the imaginary parts of the 
non-trivial zeros of L{s, x)'- 

E E - ^(^) ( ^ - ^ - - E ^ + o ( (!^) ) , 

X mod <J 7x ^ pIq \ / / 

Xt^XO 



whereas the one in |FG1] comes from their Proposition 3, which is an estimate for the average 
of the Hardy-Littlewood singular series for prime pairs defined in (Q. Their estimate takes 
the form 

where Js{y, q) is an error term. 

1.3. Method of proof and possible extensions. The proof of Theorem 11.31 is based 
on the analytic properties of the moment-generating function (Laplace transform) of Hq. 
Previous estimates on large deviations of error terms of prime counting functions |Mo2l IMnl 
IMoOdl IRSl INll IN21 ILa2] are based on an explicit formula for the moment-generating function 



of the associated random variable which involves infinite products of Bessel functions. For 
example, under GRH and LI, the moment-generating function of the distribution of the error 
term e~^^'^{Li{e^) — 7r(e^)) is given by 



7>0 



2z 



where Jq is the modified Bessel function and 7 runs through the imaginary parts of the 
nontrivial zeros of C{^)- We could not use this approach in our analysis since such a nice 
closed form is not known for the moment-generating function of Hg. Instead we use that 

H = \" \Y \^ 
-'-'5 l-'xl ' 

xec(g) 

where C (q) is a certain set of characters mod q and the are independent random variables 
whose real and imaginary parts have moment-generating functions which are known explicitly 
(see Lemma l3.4p . From this we compute the moments of Hg in terms of the moments of 
which we then bound using complex analysis. The moment-generating functions of 
3fJ(K^) and Q'(i^x) entire; this is a consequence of the fact that the n-th moments of ^{Y^) 
and Q'(l^^) are bounded above by n!5+'^9(i/iog"). Pfom this we obtain a bound for the n-th 
moment of '^^ order nl^+^ii^/ ^ which we believe is best possible (such is the case 
with the Gaussian). Hence we deduce the existence of the moment-generating function of Hg 
inside the circle \z\ = (clogg)"^. Using this information we give bounds on large deviations 
of Hg by using a method similar to that used to prove the Bernstein Inequalities. 

Remark 1.9. It is possible to say something about the limiting logarithmic distribution 
of (j){q)V{x;q)/x without the assumption LI. Indeed, one can adapt the techniques used in 
Section 2 of |Fi2) to show that this distribution has mean 

( log , - 7 - iog(2.) -j:—,+o (<!^) ) . 

p\q ^ ^ 

under GRH and the assumption that the nontrivial zeros of 



Zg{s):= n L{s,x) 



X mod <? 
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are simple and nonreal. This is based on the fact that the calculation of the mean in the 
proof of Theorem 11.21 depends only on the fact that the have zero covariance. 



Remark 1.10. One could ask if the methods of the current paper generalize to the study 
of the m-th moment 



a mod q 
{a,q)=\ 

The first problem of our approach to this problem for m > 3 is the lack of a nice formula 
such as (fTT]) . and hence it seems hard to turn this question into a question about sums of 
independent random variables. Another serious problem is that the method of moments 
would fail for m = 3. Indeed the formula analogous to (ITT]) for m = 3 contains terms of the 
form i\}[x^ x)^ with cubic characters x, and the moments of these terms grow too fast for the 
moment method to be applied. This is analogous to the fact that if Z is a standard Gaussian, 
then the distribution of is indeterminate. Indeed, Berg [Be] has given an explicit infinite 
family of distinct random variables whose moments coincide with those of Z^ . Specifically, 
if kl < I is any real number, then the random variable whose probability density is 

f{t) = ^^|t|-3e-l*l^ fl + r cos{V3\t\i) - V3sm{V3\t\l)) 

has exactly the same moments as Z^. 

2. Link with random variables 

We now relate the study of V{x; q) to that of the random variable Hg. As we will use 
orthogonality relations, it will be useful to treat real and complex characters seperately. 
Throughout the paper, C{q) will denote a fixed subset of the Dirichlet characters modulo q 
such that C{q) contains each non-principal real character, and contains exactly one of x or 
X for complex characters x- 

Definition 2.1. We define the random variables Z^.,y^ = Z^^ to be i.i.d. random variables 
uniformly distributed on the unit circle in C, where x runs over the set C{q) and 7^ runs 
over the imaginary parts of the nontrivial zeros of L{s, x) (in the case x is real, only positive 
imaginary parts will be considered) . We also define 

Hg:= J2 \Yx\', (9) 

where 

f'^Ey^>Q^7f= ifx^sreal, 
Y^:=l ^ v'^y- (10) 
V 2 ^^^0 -j=^ if X IS complex. 

Remark 2.2. It might be preferable to use the notation Z^.^^ rather than Z^^, since in the 
way we define these random variables we stipulate that if x 7^ x\ then ^x>7x ^x'^ix' ^'^^ 
independent, whatever 7^ and 7^' are. We will keep the notation Z^_^ to be more concise. 

Remark 2.3. The collection : X ^ C*(g)} is independent. This fact will be useful 

when computing the moments of Hg. 



Proof of Proposition li. 71 Using orthogonality relations, we compute 
V{x] q)= Yl 07^ 5Z ^{a)ij{x, x) 

a mod q ^ ' x¥=XO 



{a,q)=l 
1 

(f){qy 



a)X2[a) 



XI, X2 mod q 



a mod q 
{a,q)=l 



(11) 



X mod q 



(a;,x)r 



Applying GRH to the explicit formula, we obtain that 

X 



gi7xlogi- 2 



^ ' X mod g 7x ^ 



+ 0(v^(logx)^), 



so 



X mod q 7^ 
Xt^XO 



+ ^7x 



+ 0{4>{q)e 2?/3 



:i2i 



Using the fact that L{s,x) = -^("SjX) ^^'^ that real nontrivial zeros do not exist under LI, 
we transform the sum over zeros as follows: 



E El 



X mod q 7^ 
Xt^XO 



+ ^7x 



Xt^XO 7x>0 
X real 



+ «7x 



«7x 



X complex 



E/ 1 _|_ 

7x7^0 2 + ^7x 



X6C{g) ,x 
X complex 

X mod q 7x>0 2 

Xt^XO 
X real 

2 



E Ei 

GC(g) 
X complex 



e«7x?/ 



xeC(g) 7x^0 2 + 



+ E Et 



-nx?/ 



XeC-Cg) 7x7^0 2 
X complex 



Hx 



* E E«i 



X mod g ^ 7x>0 2 
Xt^XO 
X real 



+ nx 



=»7xJ/ 



2 E Et^ 

xeC(g) 7x^0 2 + ^7x 
X complex 



Now 



' I 2 



+ nxl 



T + 72, and under Li, if we order the 7^ appearing in the above sums 

C t'^ becomes equidistributed as 



my 



by size, then for every fixed k the vector (e 
y — )■ 00 by the Kronecker-Weyl Theorem. The assertion follows similarly as in Section 2 of 
[RS] or Proposition 2.3 of |FiMa| . 



^Note that if x is complex, then LI impUes that the set of imaginary parts of nontrivial zeros of L(s,x) 
is linearly independent. This follows from the fact that the zeros of positive imaginary part of both L(s,x) 
and L(s,x) appear in the set Zijq) in the statement of LI. 
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□ 



3. The first two moments of Hq 



In order to prove Theorem 11.21 we need several lemmas. 

Lemma 3.1. Assume GRH and let x Xo be a character modg. Then, letting 7^ run over 
the imaginary parts of the nontrivial zeros of L{s,x) we have 



E T— T = log - - 7 - (1 + log2 + 25?-^ 



= log g* + O (log log g* ) . 

Proof. The first equality is Lemma 3.5 of |FiMa] . The second follows from applying Little- 
wood's GRH bound L'{l,x)/L{l,x) < log log g* (see [LiJ) to the first. □ 

We will need a bound for the average of 23fJ(L'(l, x*)/L(l, x*)) over x 7^ Xo- 
Lemma 3.2. Under GRH, the following holds: 

^ L'ii,x*) , (logg)^ 
^ L{i,x > q 

X mod q ^ ' 
Xt^XO 

Proof. We have 

E #fr- E ^x^m^^AM j2 x». (13) 

X mod q x mod q n n x mod q 

X^XO Xt^XO Xt^XO 

Now, taking r = 1 in Proposition 3.4 of |FiMa) shows that for e > 1, 

(X*(p') - x{pD = l'^^^^^'^'^ P'" \\q,^>'^ and = 1 mod q/p", 
^ I otherwise. 

X mod q K 

Therefore, denoting by e{q:,p) the least e > 1 such that = 1 mod g/p^ (note that p^^^^^) > 
q/p'^) we have 

n X mod q P"!!? ^ P^ll? ^ 

p==l mod g/p" !^>1 mod g/p" 

i/,e>l 

= ^ Mp ) logP fe,). « E pe(g;p) 

p Ik 
i/>i 



Moreover, 

n p'^llg l<e<i^ p\q 

Hence, (fT3|) becomes 

X mod q n \ mod g ^ n=l mod q n ^ 

X^XO Xt^XO 

where term on the right hand side should be interpreted as the hmit of the truncated sums. 
We first treat the values of n for which n> q^: 

AH rd{<P{q)i;{t-q,l)-m) 



V r, = 1 rr,r.A r, . „2 / 



n=l mod g n>q^ 



<l>iq)^it;q,l)-m °° ^ g, 1) - 



by GRH. As for the values n < g^, we have the following elementary estimates: 

n g? + 1 g 

n=l mod q '^<i<q 
n<q^ 



«;logg. 

n<g2 

We conclude that 

^ L\\,r) ^ .(logg)^ 

^ g 

X mod g ^ ' 
Xt^XO 

and the result follows from the bound 0(g) ^ qj log log g. □ 
Lemma 3.3. For any q > 3, 

logg* = 0(g) Mogg-^^i^j = 0(g) (log g + 0(loglogg)), (14) 



X mod q ^ p|g 

Xt^XO 



(loggT = 0(g) (log g)M 1 + 



X mod q 
Xt^XO 



log log g 
logg 



Proof. The first statement is Proposition 3.3 of |FiMa| . As for the second, we adapt Propo- 
sition 3.3 of |FiMa] . The arithmetical function A2(n) := (log (i)^/i(n/(i) is supported on 
integers having at most two prime factors, and takes the following values: 



A2(p') = (2e - l)(logp)2, Aab^P?) = 21ogpi logps- (15) 



Following (3.2) of |FiMa] . we compute 



<i\q X mod <i 

E E^2(g/rf) 

X mod q d\q 
q'\d 

X mod g 

E (M^r 

X mod g 

0(g) (log g)' -2 log g logg*+ E (log^*)'- 

X mod q x mod g 



Combining this with (IMj) shows that 



^ (log g*)2 = 0(g) (log + O U(g) log q log log g + ^ (f){d)A2{q/d) 

X mod 5 ^ d|g 



and so the last step is to show that ^^i^ 0(c?)A2(g/(i) ^ 0(g) log g log log g. Arguing as in 
Lemma 3.2 of |FiMaj and using ( llSp we compute 
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r-1 

d\q p^\\q k=0 



pi<P2 0<A:2<r-2-l 
r-1 



+ 2 Yl E logPllogP20(pt^)0(P2^; 



pi<P2 0<fc2<r2-l 



P' 119 



^ 1\ p'-+i+pr-_p(^2r + l) + 2r-l 2r - 1 
p/ (p — 1)^ p 



J^0(g/p'')(logp)' 

+ 2 ^ 0(g/(p;'>2'))logPilogp2P?"V2'~^ 



^'l II ^^9 II 

Pi ll'?>P2 ll"? 
Pl<P2 



2 



«^0(g//)(logl^)'(p^'"' + ^)+0(g) «0(?)logg, 



P 119 \ Pl9 

completing the proof. 



□ 



For a real-valued random variable W, we will use the following notation for its moment- 
generating function: 

Cw{z) := Eie^"^]. 

Lemma 3.4. Assume GRH and define as in (fTOj) . Then for real characters X: the 
moment- generating function of is an even function of z given by 

cya^)--= n^o(^i^), (16) 



7x>o ^-^1 + 7^- 

where 7^ runs over the imaginary parts of the nontrivial zeros ofL{s, x) '^"■^ -^0 is the modified 
Bess el of the first kind: 

1 /2;\2n 



n=0 

If X 'is complex, then 



7x^0 

11 



(17) 



Proof. First note that the appearing in (fTOj) are independent, and thus if x is real, then 

2z 



7x>0 7x>0 



The proof of f ll6p follows since the moment-generating function of 3?(Z^^) is easily computed 
using the following integral representation of the Bessel Iq function: 

Iq(z) = - [ e'^°'^de. 
Jo 

(See Proposition 2.13 of |FiMaj for a similar derivation of the characteristic function of Y^.) 
The proof of (I17p is similar. □ 

We are now ready to prove Theorem 11.21 
Proof of Theorem li.H We start with the mean, which by ([9]) equals 
m,] = E E[|V'xr]= E A'^{yx? + ^iXxf]= E (Var[$R(r,)]+Var[53(F,)]), (18) 

since we easily get from ( ITOl) that E[3ft(yj^)] = E[53(y^)] = 0. Moreover, since the ran- 
dom variables Z^^ appearing in f IlOp are independent and since one easily computes that 
Var[9fJ(Z^^)] = |, we have for real x that 

Va.|n|^4 5:X^«M.5^1 (19) 

7x>0 'X 4 ^ 

where is the order of vanishing of L(s, x) at s = |. As for complex x, we have 

Var[$R(r,)] = Var[c>(K,)] = 2 ^ ^^^^^^ = E - 4m,. (20) 



Hence, combining (iTSll. (1191) and (120 



E[^.]= E E + 2 E E -X 

X mod g 7x ^ ^ X mod g X&C{g) 7x ^ ^ X6C(g) 

Xt^XO Xt^XO X complex x complex 

X real x real 

= E Et^-^E-^x, (21) 

X mod g 7^ 4 X Xt^XO 

Xt^XO 



by definition of C{q) and by the fact that L{s, x) = L{s, x)- Taking 

fix) ■■- 



1 — \x\ if |a;| < 1 
otherwise 



in Theorem 1.3 of [FiMi] (This also follows from Theorem 1.4 of [GJMMNPP] ) shows that 

E '^x (22) 

X mod q 
Xt^XO 
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(Since we are assuming GRH, the sum of the multiphcities at the central point is bounded 
by the 1-level density of low- lying zeros.) The first estimate for E[Hq\ follows from combining 
(I2T]) with Lemmas 13.11 and 13.31 As for the second (note that we are now assuming Chowla's 
Conjecture, so = 0), we combine (12T|) with the exact formula in Lemma [3. II to obtain 

Yl (log7-7-(l+x(-l))log2+2^^^^^il^) 



X mod q 



0(g)(logg-7-log(27r)-5^^') +0(l) + 23ft ^ 



X)' 



V\l X mod q 

by Lemma 13.31 The desired estimate follows by applying Lemma 13.21 

As for Var[ifg], we have by ^ and by the independence of the \Xx^ that 

Var[i7,]= Var[|K,P]= (e[I>^xI1 " (23) 

xec{g) xec(g) 

If X is real, then the moments of can be extracted from its moment-generating function 
which we obtained in Lemma [3.41 (note that this function is even): 



7x>0 ^\ \^lV 




2 



and so 




(-1 I yi\ 



4! 



(24) 



X'' 



2 



3(logg*)^ + 0(logg* log log g*) + 0(m^ 



by Lemma 13.11 and the Riemann-von Mangoldt Formula. 

As for complex characters x, we have by the definition of that summing over the zeros 
of ^(s,x), 



eM=4 Y 



E ^Z^-^ ^"y^ 



74 J 



71,72,73,74 ^0 (i + + 7|)^(i + 7|)^(| + 7|)^ 

Moreover, by independence of the and since E[Z^] = and |Z^| = 1, we have that 



74 J 



'1 


if 7i = 72 = 73 = 74, 







if exactly three of the 7^ are 


equal. 


< 1 


if 7i = 73 7^ 72 = 74 or 71 = 


74 7^ 72 = 73 





if 7i = 72 7^ 73 = 74, 






otherwise. 
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Hence, 

= 4^77^ + 8 E (1 ,^2. =^i^ogqr + 0{ml + logq*) (25) 

717^72 

by Lemma 13.11 and the Riemann-von Mangoldt Formula. Combining this with (p3|) and the 
previous calculation of E[|y^p] in (fT8|) and (120|) we obtain that 

Var[iJ,]= (3(logg*)2- (logg*)2 + 0(mJ + logg*loglogg*)) 

X mod q 
X real 

+ ^ (8(logg*)2-4(logg*)^ + 0(mJ + logg*loglogg*)) 

xec(g) 

X complex 

= ^ 2(logg*)2 + O(0(g)loggloglogg), 

X mod q 
Xt^XO 



smce 



< (maxm^) ^ logg ■ 0(g) 



X mod g x mod q 

X¥=X0 Xt^XO 



by the Riemann von-Mangoldt Formula and (122|) . The result follows from Lemma 13.31 □ 



4. Large deviations of Hq 



One would like to apply the existing results on large deviations such as the Montgomery- 
Odlyzko bounds |MoOd] to our question. Expanding the square in we obtain 



Xmodg7x>0 4 'X x mod g 7;,>A^>0 W 4 + 7x\/ 4 + 
X real x I'^^il 
I 7 |2 v '7 



xeC(g) 7x^0 4 + 7x ^^^^0 J | + llJ \ + 

X complex x complex 7x>-^x 

At this point we run into the problem that the random variables in this expression are not all 
mutually independent, hence this sum of random variables does not satisfy the hypotheses of 
classical theorems on large deviations. We will use an alternative approach based on bounds 
on the moment-generating function (Laplace transform) of Hq, which we will then transfer 
into bounds on large deviations of Hq. 

While the moment-generating functions of '^{Y^ and ^'(K^;) can be computed explicitly 
in terms of Bessel functions (see Lemma l3.4p . we were not able to find such a nice closed 
formula for Hq. We begin this section with an effective Stirling Formula. 

Lemma 4.1 (Stirling's Formula). We have for n > 2 that 

2.506... = < , < V2^e^ = 2.613... 

y/n[n/e)'^ 
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Proof. See 6.1.42 of {KS]. □ 



Lemma 4.2. Assume GRH and let be the random variable defined in f iTOl) . We have for 
q* large enough and for n > 1 that 



E[|K,|'"] < 5.7ni 



An log q*\^ 
e-o{l) 



Proof. We will use the explicit formula for the moment-generating functions of ^{Y^) and 
appearing in Lemma 13.41 Note that Io{z) is an entire function of z, and so the 
absolutely convergent product (fT6|) is also an entire function. The Taylor series of Io{z) gives 
the following immediate bound: 

n=0 ^ ^ n=0 ^ ^ 

For real characters x, this gives a bound on £y^(z), since by (fT6|) . 

|/:y^(2:)| < exp I 5^ < exp (^(logg* + 0{\og\o^q*))\ 

\7x>o 4 + y V / 

for g* large enough, by Lemma [3. II We now use this to bound the moments of Y^. Cauchy's 
formula for the derivatives reads 

.2„i (2n)! /• , dz 



and so by our bound on Ly (2;) 



|E[r-]|<(2n)!exp \^^^\C-'- 
Taking > 1 and C = (2n/logg*)^, we obtain 



E[i;^"] < (2n)! ^ 
which by applying Lemma [4.11 gives the bound 



1 f 2n log g* ^ " 
e-o(l) 



If X is complex, then we apply the above argument to the moment-generating function of 
^{Y^) and Q'(Vx) (^^^ (HTl)). Doing so, we obtain the following bound: 



^2ni ^ro^v^2ni / o ^„ W 2n log 



E[3?(r^)'"],E[55(K,)'"] < 3.7n 



.e-o(l) 
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We finish the proof by combining this with Lemma 14.11 and the Cauchy-Schwartz inequality 
in the form |e[XF]| < e[X^]2E[F^]2 for real random variables X,Y: 



2kc^(yr ■|2(n-fc)j 



< 



< 



fc=0 ^ ^ 



n-1 



5.7$: 



(n/e)"'n2 



Ak log q* 

\ {klefk^{{n - k)/eY-^{n - k)^'" - o{l) 
An log q' 



{n — k) 



A{n — k) logg* 
e-o(l) 



n—k 



< 



+ 2 -3.75(272)4 
' An log q 



e-o(l) 



5.7y^n^ 



fc=i 



e-o 1 



5.7n2 



An log g* 
e-o(l) 



□ 



Remark 4.3. An important fact used in the last proof is that £sj{(y^)(2;) and £cj(y^)(2) are 
entire functions, which we integrated on the circle \z\ = (2n/logg*)5, whose radius tends 
to infinity with n. This would not have worked with the cumulant-generating function 
log E[e*^^^^''''], which has poles by ( fT6l) since Io{z) has infinitely many zeros on the imaginary 
axis. 

Now that we have bounded the moments of \Yy^\'^, we will turn this information into a 
bound on CHg{z), the moment-generating function of Hg. Instead of studying the moments 
Hq itself, we will study its centered moments, by defining 



X mod q 



where (see (^) 

Wx ■■= \Yx\' - E[\Y^\% (26) 

Lemma 4.4. Assume GRH. For q large enough, the moment- generating function of Hg 
satisfies, in the range \t\ < (40 log g)^^, 



£^^(t)<(l + 184t2(logg)2)*W. 

16 



(27) 



Proof. Let < t < (40 log g) ^. Using the identity Cx+c{z) = e^'^Cx{z), we have for q large 
enough and defined as in (p6|) that 



oo 



4-n 



XI 

n=0 



< 



J.71 

:i - tE[\Y^\'] + 0.5Qlt'E[\Y^\r) J2 ^[l^xl'l;^ 



(28) 



n=0 



n=2 



n\ (n-1)! (n-2)! 



Now, by Lemma [4. 2 [ we have for n>l that 

4n log g 
e-o(l) 



E[|y;|2"i < 5.7nt 



Hence, applying Lemma [4.H we obtain 

oo 

= 1 + 4.65^n((4 + o(l))tlogg)" 

n=2 

< l + 184t2(logg)2, 

as long as < t < (40 logg)~^. Since the are all mutually independent, the proof follows 
by multiplicativity: 

^H,w= n ^^xW- 

X mod 5 

A similar argument works in the range —(40 log g)^^ < t < 0. 

□ 

Proof of Theorem \1.3\. upper bound. First, by Theorem I L 21 we have for q large enough and 
for (loglogg)Vlogg < e < 1 (recall Hg = Hg - E[Hg]) that 

Prob[ifg > (1 + e)0(g) logg] < Prob[^g > O.99e0(g) logg]. 

As is customary (this is Chernoff's inequality), we relate the large deviations of Hg to its 
moment-generating function using Markov's inequality: 

Prob[ifg >V] = Prob[e*^' > e*^] < e-*^E[e*^']. 

Taking V = O.99e0(g) logg we obtain that 

Prob[if, > (1 + e)0(g) logg] < exp(-O.99te0(g) logg)£^^(t), 

17 



which from Lemma [4.41 is. for t = e/(3701ogg) (we use that e < 9), 

< exp(-0.99eV(g)/370)(l + 184eV3702)'^(«) 
= exp[0(g)(-O.99eV37O + log(l + 18467370^))] 
<exp(-e20(g)/751), 

since for x > 0, log(l + x) < x. We have therefore estabhshed the bound 

Prob[if, > (1 + e)0(g) logg] < exp(-e^0(g)/751). 
We conclude the proof by writing 

Prob[iJ5 < -V] = Prob[-if5 > V] = Prob[e"*-^' > e*^] < e-*^E[e- 
and by applying the same reasoning to t), from which we deduce that 

PToh[Hq < (1 - e)0(g) logg] < exp(-e20(g)/751). 
For the lower bound we will use the following inequality. 



tH„ 



□ 



Lemma 4.5 (Paley-Zygmund Inequality). If X > is a random variable having a second 
moment, then for any < a < 1 we have 

Prob[X > aE[X]] > (1 - a)2E[X]VE[X2]. 



Proof. Let / C R be an interval and define the random variable Ixei as follows: 



1 if X G / 
otherwise, 



so that e[1xg/] = Prob[X G /]. Using this notation we have for any U > that 

E[X] = E[Xlx<c/] +E[Xlx>c/] < f/+E[X2]5Prob[X > f/]5 

by the Cauchy-Schwartz inequality and the fact that l^g/ = Ixei- The proof follows by 
taking U = aE[X]. 

□ 

Proof of Theorem \1.3[ lower bound. By Lemma 14. 5[ we have that if t and V are such that 
gty < E[e*^9]/2, then 

PToh[Hg >V]= Prob[e*^^ > e*^] > ^—J-. 

4 E[e^*-'^9] 
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Taking V = {1 + e)(j){q) logg — E[i7g], we need to select t for which e*^ < Y\e^^i]/2. We start 
with X reaL By lemmas [4. II and [4.21 we have that in the range < t < (100 logg*)~^, 



E e 



i+«Eiinpi4Eiiy,n+o(5.7E„}(i!^ 

l + mY.?] + '-m.\'] + 0(2.2^ 3(4|t|logg*)3 



4nlogg* V 



2 V (1 -4|t|logg*; 

= 1 + tE[|Fj2] + ^E[|K,r] + O (475(|t| logg*)3) , 

where O means that the implied constant is one. Therefore, in this range of t and for q large 
enough, one shows using the estimates E[|y^p] ~ logg* and e[|K|j,|^] ~ 3(logg*)^ obtained in 
the proof of Theorem O (see dH]) and (^^) that for < t < (2100 log 

E[e*^x] = e-*"[l^-l'lE[e*l^-l'] = 1 + ^(E[|rj4] - ^[\Y^\Y) + O (481(|t| logg*)=') . 
Hence, for q* large enough and for < t < (2100 log g*)~^. 



For complex x, we obtain a similar estimate using f l20l) and ( l25l) . with the constant 0.77 
replaced with 1.54. This shows that 

E[e*^«]= J] E[e*^-] >exp('o.77t2 J] (logg*)^') > e°-^^*''^('')(i°s</)^ (39) 



XGC((j) ^ X mod g 



for g large enough, (log log g)^/ log g < e < 3000 ^ and 1.4e/logg <t < (2100 log g*) ^, since 
Theorem 11.21 shows that in this range, 

(l + e)0(g) logg-E[i7g] ~e(/.(g)logg. 

We conclude by Lemma [4.41 and ( [29]) that 

Prob[i^„ >V]> - ^ ) > ie^-^*^('')('°s 9)^(1 + i84(2tlogg)2)-'^('?). 
Taking t = 1.4e/ logg gives the result. 

□ 



5. Concluding Remarks 

Going from Theorem 11.31 to Conjecture 11.11 is not direct. Indeed, if we are studying the 
quantity V{x; q) for g and a; in a given range such as (log logo;) ^'^'^ < g < x''^^\ then it is not 
clear that the limiting logarithmic distribution of V{x\ q) coincides with that of Hq. Indeed 
one would need to show that in the range (logF)^^^ < g < 6°^^-*, we have for every fixed 
m > 1 that 

i 1 {<P{q)e-yV{e^-,q)rdyr.E[H^]. (30) 
19 



This last integral is similar to a 2m-correlation sum of low-lying zeros of Dirichlet L-functions. 
Indeed, expanding the m-th power we obtain from (fT2|) under GRH that in the range 
(logF)i+^ <q< e°(^), 



''{ct>{q)e-yV{ey-q)-o{l)rdy = 



( 



g«(7x-7x)y 



X mod q 7^ .7x 



dy 



I 



E E 

7^X0 7x1:7x1 .■■■,7xm>7i 



e*^'^'^! +-+7xm -7x1 -■■■-'^'xrJydy 

, (i + ^7xi)(i - *7;j ■ ■ ■ (I + ^7x™)(l - ^7^) 



^ E E 

Xlv.Xm^XO 7xi,7xiv,7xm.7xm 

7xi-7xi+---.+7xm-7x™=0 



+ 



E 



(i + ^7xi)(i - ^7^,1) • ■ ■ (I + ^7x,J(i - ^7^) 
(e^(7xi+...+7x.-7ii-...-7i.)^ - l)(7xi + - + 7x™ - 7' - - - I'S' 



Xlv,Xm7^X0 

7x1.7x1 v,7xm, 7 

Xm 

■Txi-7xi+---+7xm-7i™7^0 



(31) 



If the last sum was running over the zeros of a single L-function, then we would run into the 
problem that if two zeros 7, 7' are extremely close to each other, then e^^"i~"i -'■^(7 — 7') is very 
close to y, giving a significant contribution to (131 p . However in the present situation we are 
taking an average over all Dirichlet L-functions modulo g, and hence the number of pairs 
of such zeros will be negligible compared to the size of the family we average over, under 
assumptions on statistics on zeros of Dirichlet L-functions. 

We now show how a conjecture on the pair correlation of low-lying zeros of Dirichlet L- 
functions implies that fl30|) holds for m = 1 in the range (log log x)^"'"^ < < x°^^\ Using 
Schlage-Puchta's method [SPJ . one shows that the last term in (13T]) is an error term for fixed 
values of g, and from this we can conclude under GRH and LI that 



9 J 



XI: 



■■.Xm^XO 



E 



7xi:7xiv,7xm.7x„ 

7xi-7xi+---.+7xm-7x 



(i + ^7xi)(i - n'xd ■ ■ ■ (I + ^7x™)(i - ^ixJ ' 



(This actually follows from Theorem ll.3[ with the same method as in Lemma 2.5 of [Fi2].) 
Thus f l30l) reduces to the statement that the last term in (1311) is an error term for every fixed 
m and for values of q not necessarily fixed. Taking m = 1, the last term in (13T]) is 



X mod (J 7x^7^ 
Xt^XO 



(e-fa-7^)v--l)(^^-^;^)- 
(i + nx)(|-^7t) 



E E 

X mod 12 7x^7' 



(l + |7xl)(l + l7'l) 



and we would like to show that in the range (logF)^^ < g < we have T(y; g) = 

o(yE[i7g]), that is T{Y\q) = o{Y (j){q) log q) . Arguing as in Lemma 2.6 of |Fi2] (this is 
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Schlage-Puchta's technique |SP| ). we have introducing a parameter U > 1 that 

l7x-7;^l>l 0<|7x-7il<l 

0<|7x-7il<l 
7x,7;,<t^ 

We compare the first term with an integrah 

T \ ff \og{qx)\og{qy)dxdy ^ 

which is o{Y (j){q) log q) as soon as logg = o{Y) (this holds in our range of q). As for the 
second term, we have by the Riemann-von Mangoldt formula that 



11 



yiog7^ (logiqU)y 



_ + )2 ^^^^ f/ 

X mod (jf 7^>?7 ^ ^ 



The third term is the hardest, and requires to make the following conjecture on the pair 
correlation of zeros of Dirichlet L-functions: 

Conjecture 5.1. Fix C > 0. There exists a bounded function W{t) > such that in the 
range C^^ < S < logg, 1 < f/ < ClogQ we have 

J2 ^ ^x,7; < f/ : < |7x -7^1 < S/logq} «c <j){q)U log{qU) f W{t)dt. (33) 

X mod q ^ 

Assuming this conjecture, we have by fixing e > and using summation by parts that 
"xtTd, t'. (l + l7xl)(l + l7;,l) ^f^,^ (l + l7xl)(l + l7;,l) 

0<l7x-7il<Eii Ijj^<!7x-7;,I<1 
7x.7i<C/ 7x-7^<'^ 

log q 

< Y(j){q)e log q + 0(g) log g. 

e 

Collecting all these terms we obtain that 

I + 11 + 111 <^ 0(g) (log g)2 + F0(g)ii^S^^ + r0(g)elogg + 0(g) ^ logg, 

which by taking U = (log g) /e is 

« 0(g)(logg)2 + eF0(g) logg + 0(g)M^, 

a quantity which is = o(y 0(g) logg) when g is in the range (logr)^+'' < g < e''^'*'). This 
justifies why (l3Up should hold in this range. 
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One can justify (130|) for all m > 1 with a similar argument, under an assumption on the 
statistics of zeros of Dirichlet L-functions. 

Note that in the range (logx)^"'"'' < q < x°^^\ knowing the first two moments of e~^/^V^(e^; q) 
is sufficient for justifying (jlj). Indeed, Chebyshev's inequality shows that 

Prob[|i7, - E[H,]\ > e^{q) \ogq] < 



e20(g)' 



and the following argument gives the desired result in this range of q. 

We now show how to support Conjecture II. ![ assuming that ( 130|) holds in the range 
{\ogYy+^ < q < e°(^) (the reason why we chose this upper bound is that in the range 

we only need (15U|) to hold for m = 1,2). Theorem 11.31 shows that 

under GRH and LI, 

meas{y < Y : 0(g)e-^V^(e^; g) G (a,/3)} ~ rProb[/f, G («,/?)], 
and so in the range (logy)^+^ <q< e°^^\ Theorem O gives 

meas{y < Y : e"^V^(e^; q) ^ ((1 - e) logg, (1 + e) logg)} 

~ FProb[^q ^ (-e0(g) logg, e0(g) logg)] 
< 2Fexp (-cse^- 



However, since e~^l^(e^; g) can be understood by looking at the equidistribution of the 
vector (e*'''^^, e*'''*'^) G t'^, we expect that the smallest value of y for which \e~^V{e^; q) — 
loggi 7^ o(logg) is about y ^ exp (c0(g)). That is to say, for g > (logyY^^ we have 
e~^V^(e^; g) ~ logg, which is equivalent to Conjecture 11.11 

Remark 5.2. Theorem 11.31 and Remark 11.41 even suggest the following estimate, for 
(log log 0;)^+'^ < g < 




V{x;q)=xC{q)\^l + 0\^^{x)^^^] |, (34) 

where \&(x) is any function tending to infinity with x and 

£(g) := log g - 7 - log(27r) - V 

^-^ p — 1 

v\<i 

Remark 5.3. It would be interesting to investigate the large deviations of Hq in Theorem 
11.31 for larger values of e. Indeed we believe that a transition happens near e x 1, and this 
could give information about V[x\ g) in the range g < log log x. For example one could make 
a prediction on the best possible bound for V{x\ q) in this range. If g is fixed, then one 
can show using f lTT]) that the limiting distribution of e~^'V{e^\ q) has double-exponentially 
decaying tails (this follows from Montgomery's work |Mo2] ). resulting in the prediction 

V^(x; g) ^ x(logloglogx)''. (35) 

Again this is for fixed values of g, and shows that a transition happens in the range 1 < g < 
(log log x) , in transferring from (l34l) to (135|) . 
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Appendix A. Some comments on Montgomery's Conjecture 
Montgomery's Conjecture |Mol] states that if we fix e > 0, then 

^{x]q,a) = —— + OJ — 

uniformly for all coprime integers a,q < x^. The Generalized Riemann Hypothesis implies 
that for X Xo, 



"^^i^i X) ■= ^ A(n)x(n) < (log 



n<x 



and thus using orthogonality relations we obtain the following standard estimate: 



Montgomery's Conjecture is motivated by the fact that we did not exploit any cancellations 
in the last estimate. Indeed, since under GRH and LI the quantity x(a)e~^''^'?/'(e^, x) is dis- 
tributed like a random variable of mean zero and variance asymptotically log q*, we have that 

the limiting distribution of e~^/^ mod q xiO')'4^i^^i x) has variance asymptotically (f){q) log q. 

x¥=xo ^ 
This means that this last sum is normally of order {(f){q) logg)2+^, which in turn gives that 

ijj{x; q, a) — "^^^^'^^"^ is normally of order X2 that is Montgomery's Conjecture should 

hold. 

As for the quantity ip{x, we do not expect any improvement under GRH of the form 

'^{x,X) < 

for any fixed 6 > 0. Indeed this last bound can readily be disproved under GRH using 
either ([5]) or ([3]) (in the range x/(logx)"^ < g < x it can even be disproved unconditionally 
using ([T])). Moreover, we believe that it is interesting to see what Montgomery's Conjecture 
implies on this quantity, using the orthogonality relations. Indeed under this conjecture we 
have for x Xo that 



,/ \ / N Z' ,/ \ '^{X^X0)\ ,/ xX2+' 1 1 

^{x,X)= X Xia) { q, a) < 0(g)^^ < q^x 

ia,q) = l 

which is worse that GRH. Again, the discrepancy between this and the 'true bound' ipi^^ x) ^ 

X2(loga;)^ comes from the fact that square-root cancellation occurs in the last sum. 
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